🧠 LLM Inference - emschwartz · Scour

🧠 LLM Inference

Quantization, Attention Mechanisms, Batch Processing, KV Caching

How are LLMs trained : For Engineers

pub.towardsai.net·4h

🏆LLM Benchmarking

Is EXL3 doomed?

github.com·8h·

Discuss: r/LocalLLaMA

Testing LLM Responses: A Fast, Cost-Effective Alternative to LLM-as-Judge

joywrites.dev·18h·

Discuss: Hacker News

🏆LLM Benchmarking

Unsupervised Model Improvement via Internal Coherence Maximization

huggingface.co·9h·

Discuss: Hacker News

🏆LLM Benchmarking

Using an ordinary differential equation model to separate rest and task signals in fMRI

nature.com·19h

🔍AI Interpretability

🎲 How I'm using AI to improve my software engineering productivity (and why it will not ...

swiftrocks.com·3h

👨‍💻AI Coding

AI Agents have, so far, mostly been a dud

garymarcus.substack.com·4h·

Discuss: Substack

Creating a Toy Programming Language with Actor-Based Parallelism

pointersgonewild.com·23h·

Discuss: Hacker News

💻Programming languages

Show HN: Open-source Voice Cloning at 16x real-time: Porting Chatterbox to vLLM

github.com·7h·

Discuss: Hacker News, r/LocalLLaMA

💾Prompt Caching

Using Dspy to Detect Document Boundaries

kmad.ai·19h·

Discuss: Hacker News

📝Text Parsing

Building for the Era of Experience

rnikhil.com·10h·

Discuss: Hacker News

Persona Vectors - Anthropic Paper

lesswrong.com·7h

Solving Pell Equations with Index Calculus

leetarxiv.substack.com·4h·

Discuss: Substack, r/programming

🧮SMT Solvers

Big O vs Hardware: Better Complexity ≠ Better Performance

blog.codingconfessions.com·4h

🧮Compute Optimization

As you get older your memory gets less reliable. It makes programming more of a challenge because as your software gets more features, there's more to remember ...

scripting.com·9h

💾Prompt Caching

what kind of stack do you think this software uses?

linkedin.com·17h·

Discuss: r/SoftwareEngineering

🪄Prompt Engineering

What an Hour Without ChatGPT Tells Us About Our Current State of Entanglement

pub.towardsai.net·1h

💾Prompt Caching

Task-based returns to generative AI: Evidence from a central bank

cepr.org·8h·

Discuss: Hacker News

🏆LLM Benchmarking

Different types of ML roles and teams at FAANG, and what they entail

interviewing.io·1h·

Discuss: Hacker News

🏆LLM Benchmarking

Generative AI designed 5 new battery materials - more powerful and sustainable than lithium

nordot.app·5h

Loading more...